JOURNAL OF RESEARCH of the National Bureau of Standards— B. Mathematics and Mathematical Physics 

Vol. 65B, No. 1, January-March 1961 

Bound for the P-Condition Number of Matrices With 

Positive Roots 

Philip J. Davis, Emilie V. Haynsworth, and Marvin Marcus 

(September 27, 1960) 

Let a matrix A have positive roots 0<Xi<A 2 < . . . <X n . Upper and lower bounds 
for the P-condition number of A, P = X n /Xi, are given in terms of det A and one other 
symmetric function of the roots. 



1. Introduction 

Suppose A=(a,ij) is a nonsingular nXn matrix 
with roots, \i(i=l, . . . , n), ordered so that 

0<|Xi|<|X 2 i<.. .<|X n |. 

For a wide class of matrices A, the ratio |X n |/|Xi| 
gives a rough measure of the probahle accuracy of 
the computation of the inverse of A, or the solution 
of the system of equations for which A is the matrix 
of coefficients. 

This measure was evaluated in some detail by 
von Neumann and Goldstine [5] 1 and has been called 
by J. Todd [2, 3, 4] the P- condition number of the 
matrix, i.e., 



In general the accuracy of the results is in proportion 
to the reciprocal of P. 

In this paper we shall show that, if the roots of A 
are all positive with 



0<X!<X 2 <. . .<X„ 



(1) 



and the determinant of A, det A, is known, together 
with one other symmetric function of the roots, 
upper and lower bounds for P can be given in 
terms of these two known quantities. 

More precisely, if the characteristic polynomial 
for the matrix A is 



p( x )=x n — dx n lJ rc 2 x n 2 



(i=l f ... f n) (2) 



and we know c k and c n =(dct A), there exist positive 
upper and lower bounds for the ratio 



v 



(3) 



in terms of c k and c n . 

To simplify the notation in the statement and 
proof of the theorem we use the following device. 

i Figures in brackets indicate the literature references at the end of this paper. 



Let 

Cfc=c fc (Xi, • • ., K)=(tj «*(Xi . . ., K)={2 f*> ( 4 ) 

and divide each root of p(x) by (s k ) 1/k . (This is 
equivalent to dividing each element of the matrix by 
(s k y/ k .) We will call the new matrix "normalized 
with respect to k" or simply the normalized matrix. 
Since the condition number, P, is the ratio of two 
roots, P will not be affected by such a transformation. 
Let Die be the determinant of the normalized matrix. 
Then 

/ n \n/k 
\k) C » 



1> 



(5) 



(«*r 



^k 



From Hardy-Littlewood-Polya [1] the numbers s k 
as defined in (4) satisfy the inequalities 



si>sl f2 >sl /s >. . .>si n 



(6) 



(where equality holds only if all the X* are equal) 
hence, by (5), I) k <\. 

2. Statement and Proof of Theorem 

Theorem: If we have a matrix A=(a tj ) with 
characteristic polynomial {2) and if the constant term 
and the kth coefficient are known, the following bounds 
hold for P, 



I <P< 

Ly k 



l+Vi-zs- 1 , 

l—Jl-Dl- 1 ' 



(*=1, . . ., n-1) (7) 



where D is defined by (4) and (5). 

If k=l {i.e., the trace of the matrix is given) we can 
improve the upper bound: 



P< 



l-Vi-A' 



A= 



cl 



(8) 



Proof: Part 1 — Upper bound. We prove (8) 
first, as the method used for this can be applied in 
the proof of (7). 



Since P=\ n /\ u eq (8) is equivalent to 

4XiX n 



A< 



(Xi+Xj 2 



(9) 



13 



so from (5), we must show that 

(Xi+X,)^ / ft y 

4XiX w -\nJsJ 



(10) 



But the right side of (10) is (AJG n ) n where A n 
is the arithmetic mean, and G n the geometric mean, 
of Xi, . . ., X w . 

Thus, we can prove (10) by induction if we prove 
the following: 

Lemma: 

(A n y/A n+1 y+i (n) 

\G n J ~\G n +iJ 

where A n+1 and 6? w+1 are the arithmetic and geo- 
metric means of Xi, . . ., \ n , X w+i . (The X* are not 
assumed to be ordered with respect to size for this 
lemma.) 

Proof of lemma: Simplifying (11) we see we 
must prove 

^n + lAZ^An+i. (12) 

But (12) follows from the arithmetic-geometric mean 
inequality (cf., Hardy, Littlewood, Polya [1]), 



aid 2 






+a 2 + . . . +a n 



< i 



n-f-1 



n+l 



if we let a t =A n , (i=l, . . ., n) and a n+ .i = \ n +i. 

Thus (11) holds and (10) follows immediately 
using finite induction on n. 

To prove (7) we note that, by the inequalities (6), 

qW-1 

T) n ~ l <: T) n - 1 —-^ , 



so it suffices to show that 

(Xi+Xj 2 
4XA W -D\ 



n n-\ V S n ) 



(13) 



Now let mi^X^ 1 , . . . , m^=Xi l and we note that 
0<mi< • • • <Mr* and (13) becomes 



0*1 + /0 



»)'</ *1 (ML.. M») Y T H4) 



4miM. 



But (14) is the same as (10) with the /x's substituted 
for the X's, hence (13) holds and the general upper 
bound (7) is true for all values of k. 

Proof: Part 2 — Lower Bound. From the in- 
equalities (6), 



A= 



Q nik 



> 



where, by the definition (4) 

Hence 

1 <*?< xg SK\ n - 
D k ~s n -\r l K W 

which proves the lower bound in (7) . 



3. Some Remarks 

In reference [6] T. Kato gives an upper bound for 
the P-condition number which in our notation is 

^A 

This is larger than the bound given in (8) . 

We derive several consequences from (7), (8). 
Suppose that we have a matrix with positive roots, 

which is normalized so that c k =l i J, 

Then D k =det A, and we can write 



i.e., s k =l. 



1 



(det 4) 



g i+y i-(detA) --;, (15) 



l/U-l) — x — 



1— yJl-idetA) 71 - 1 



This implies that 

det A->0 if and only if P-^co } 

det A-+1 if and only if P->1. 

Hence, the det A behaves essentially as the reciprocal 
of the P-condition number and may be said to con- 
stitute a reasonable condition number of its own 
for such matrices. This lends substance to the 
popular feeling that for a properly defined class of 
normalized matrices the smallness of its determinan b 
is accompanied by difficulty in inversion. 

A second observation relates to least square ap- 
proximations. Express the general problem in the 
language of inner product spaces. We are required 
to solve \\y— 2 T t= 1 a i x i \\=mmimvLm, where y is given 
and Xi are independent elements. The normal equa- 
tions have matrix ((x iy Xj)) where (p,q) designates 
the inner product. We can assume x x normalized: 
{x t ,Xi) = \[Xi\\ 2 =l. The Gram matrix ({x^X})) is 
positive definite symmetric and hence falls within 
the scope of our inequality with k= 1, Ci=n. Hence 
if G is the Gram determinant |fe,x ; )| we have 



1 



G 



!/(»-!) — 



<P< 



i+yr^£ 
i-Vi 1 ^' 



(16) 



The quantity G, which acts as a "measure" of linear 
independence of o^i, . . . , x ni therefore also serves 
as a condition number for the normal equations. 
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